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(54) Information recording and reproduction 



(57) A recording apparatus configured to receive 
analog video and audio signals, digitize and compress 
the same, and record the compressed audio and video 
signals on a digital recording medium such as an optical 
disk. As the audio and video signals are received and 
recorded, time segments of the audio signals are ana- 
lyzed for certain features such as whether the time seg- 
ment corresponds to instrumental music, vocal music or 
conversational speech. A table of contents is then gen- 



erated corresponding to the feature analysis and digit- 
ally stored on the storage medium. As a result, the re- 
corded audio is characterized over time, e.g. , on a frame 
by frame basis. A high degree of versatility is thereby 
provided in the playback process, such as the ability to 
skip portions having certain audio types or to quickly 
scroll to desired portions of the recorded audio pro- 
grams. Reproducing apparatuses enable the user to se- 
lectively reproduce segments of the recorded audio and 
video. 
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Deacriptlon 

The present invention relates to information record- 
ing and reproduction, and more particularly, to recording 
apparatus, recording methods, reproduction apparatus 
and reproduction methods. The method and apparatus 
are applicable to audio or audio and video signals. 

Presently, analog-type video tape recorders (VTRs) 
are commonly used to record and reproduce analog vid- 
eo and audio signals of a television broadcast. It is con- 
templated that digital data corresponding to such analog 
video and audio signals will be commonly recorded on 
a digital storage medium such as an optical disk or a 
magnetic hard disk. 

With conventional VTRs, reproduction is facilitated 
by introducing various identifying signals during the re- 
cording process. For example, one type of identifying 
signal is used to identify whether a television broadcast 
is a bilingual broadcast or a stereo broadcast. As a re- 
sult, a television receiver can discriminate between 
these two types of broadcasts and control the audio sig- 
nal output method accordingly. 

Contemporary optical disks and hard disks have su- 
perior accessibility, i.e., random access capability, as 
compared to analog magnetic tapes. As such, various 
fast viewing and listening methods are now being con- 
sidered for these disks, such as speech speed conver- 
sion and selective skipping of song contents, in contrast, 
conventional VTRs lack such capability some prbr art 
VTRs include an automatic audio selection function or 
the like, while others have a speech speed conversion 
feature. A drawback of the speech speed conversion 
feature, however, is that video and audio are processed 
independently This Is problematic in that the output au- 
dio and video may become unsynchronized. resulting in 
unnatural audiovisual output, e.g., lips moving before or 
after the audio is produced. 

Conventional laser disc players CLDPs") are capa- 
ble of header search for karaoke use (i.e., for in use in 
a sing-along machine). However, in the case of ordinary 
broadcasts, users in many instances desire to view con- 
versational-type programs between musical perform- 
ance programs. In these cases, the conventional LDP, 
which is capable only of header search, is inadequate. 

Aspects of the invention are specified in the claims 
to which attention is invited. 

An embodiment of the present invention seeks to 
provide a recording apparatus capable of recording an 
audio or audiovisual signal on a digital storage medium, 
while concomitantly analysing its characteristics over 
time for particular audio types and storing information 
indicative of such characteristics on the storage medi- 
um. 

A further embodiment of the invention seeks to pro- 
vide a reproduction apparatus that allows for selective 
reproduction of the audio or audbvisuat signal so re- 
corded based on user selection of a particular audio 
type. 



Another embodiment of the invention seeks to pro- 
vide recording and reproduction apparatuses with en- 
hanced features. 

In an illustrative embodiment of the invention, there 
s is provided an information recording apparatus for re- 
cording at least an audio signal onto a recording medi- 
um, which includes detection circuitry for detecting a 
feature of the audio signal, and recording circuitry for 
recording together with the audk) signal additional infor- 
mation that corresponds to the detected feature. Pref- 
erably, features of the audio signal are detected in a 
time-segmented manner, such that segments or frames 
of the audb signal are each characterized. For example, 
features that may be detected by the detection circuitry 
may include: whether a given segment comprises mut- 
ed audio; whether it comprises music; or whether it com- 
prises conversational speech. 

With the feature information stored on the recording 
medium, versatility during reproduction is advanta- 
geously possible, thereby providing the user with a high- 
ly versatile tool during playback. For instance, the user 
is able to skip portions of the recorded material having 
an undesired audio type or types, or to quickly locate a 
desired portion of the recorded material by selective 
skipping based on audio types. 

In another illustrative embodiment, there is provid- 
ed an information reproduction apparatus for reproduc- 
ing at least an audio signal corresponding to audio data 
recorded on a recording medium on which additional in- 
formation relating to at least the audio signal is also re- 
corded. The apparatus includes reading means for 
reading out a portion of the additional infomnation prior 
to any reproduction of a corresponding portion of the au- 
dio signal; determining means for determining whether 
to reproduce the corresponding portion of the audio sig- 
nal In accordance with the read-out portion of the addi- 
tional information and a current operating mode; and 
control means for controlling reproduction of the corre- 
sponding portion of the audio signal in accordance with 
a determination by the determining means. 

The following detailed description, given by way of 
example and not intended to limit the present invention 
solely thereto, will best be appreciated in conjunction 
with the accompanying drawings, in which like reference 
numerals denote like elements and parts, wherein: 

FIG. 1 is a block diagram of an illustrative configu- 
ration of an information recording apparatus ac- 
cording to an embodiment of the present invention; 
FIG. 2 illustrates an illustrative arrangement of stor- 
age regions on a disk; 

FIG. 3 is a flowchart showing the operation of the 
information recording apparatus of FIG. 1; 
FIGS. 4 and 5 are flowcharts showing a process of 
generating a subcode indicative of an audk) feature; 
FIGS. 6 and 7 are timing diagrams showing output 
timing of signals flowing within the respective 
processing systems of FIG. 1 ; 
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FIG. 8 is a block diagram showing an illustrative 
configuration of an infonmation reproduction appa- 
ratus according to an embodiment of the invention; 
FIG. 9 is a flowchart showing the operation of the 
information reproductbn apparatus of FIG. 8; 
FIG. 10 is a timing diagram showing output timing 
of signals flowing within the respective processing 
systems of FIG. 8; 

FIG. 11 is a block diagram showing an illustrative 
configuration of an Infonmation reproduction appa- 
ratus according to another embodiment of the in- 
vention; 

FIG. 12 is a flowchart illustrating the operation of 
the infonnation reproduction apparatus of FIG. 11; 
and 

FIG. 13 is a timing diagram showing output timing 
of signals flowing within the respective processing 
systems of FIG. 11. 

FIG. 1 is a block diagram of a first illustrative em- 
bodiment of the present invention, designated as re- 
cording apparatus 1 00. As will be described in detail be- 
low, recording apparatus 1 00 is configured to selectively 
receive various types of analog input signals, such as a 
television broadcast signal or a camera system output 
signal. The apparatus converts the selected input signal 
to a digital signal, and compresses and records the 
same on a digital storage medium such as an optical or 
magnetic disk. As the audio and video signals are re- 
ceived and recorded, characteristics of the audio signal 
are analyzed over time so as to categorize its contents 
in a time-segmented manner. In particular, individual 
frames of the audio signal are analyzed to determine 
which frames or frame sequences correspond to. e.g., 
music, conversational speech, or muted audio. Each 
segment of the recorded audio program is thereby cat- 
egorized. A user table of contents is then generated cor- 
responding to the categorization of the audio signal. The 
table of contents is recorded onto the digital storage me- 
dium, either in a specific region of the recording medium, 
or distributed as subcodes in the same regions as the 
recorded audioA^ideo data. The table of contents allows 
a user to play back a selected type of audio and asso- 
ciated video data while skipping other types, or to quick- 
ly access desired portions of the recorded audiovisual 
program by selective skipping of certain audio types, 
and so forth. 

Recording apparatus 100 will now be described in 
detail. Video signal processing system 1 is configured 
to receive an external input video signal, such as a VTR 
video output, and perform various kinds of processing 
on the signal such as automatic gain control (AGC). A 
camera signal processing system 2 operates to receive 
a video signal from a charge coupled device (CCD) cam- 
era or the like and convert it into a standard protocol 
signal such as a National Television System Committee 
(NTSC) video signal. Tuner system 3 receives a televi- 
sion broadcast signal via an antenna system (not 



shown), and converts a selected channel of the televi- 
sion signal into video and audio signals through video 
detection, video amplification and audio detection. 
Audio signal processing system 7 is adapted to re- 
5 ceive and amplify an external audio signal, e.g., the au- 
dio output from the VTR supplying the video signal to 
system 1 . A microphone input audio processing system 
8 amplifies an audio signal inputted through a micro- 
phone and performs AGO processing thereon. 
10 The video output signals from each of systems 1 , 2 
and 3 are applied as inputs to video signal switching sys- 
tem 4, which switches a selected one of the video sig- 
nals to its output in accordance with a selection control 
signal from system controller 14. Likewise, audio signal 
switching system 9 routes the selected one of the audio 
signals from systems 3, 7 and 8 to its output based on 
the control signal from system controller 14. 

In the video path, the analog video output of switch- 
ing system 4 Is applied to video signal A/D conversion 
system 5 where it is converted to a digital video signal 
and then quantized. The quantized, digital video signal 
Is then compressed by video compressing and process- 
ing system 6 in accordance with a standard compres- 
sion protocol such as the joint photographic experts 
group (JPEG) or the moving picture experts group 
(MPEG) schemes. The compressed video signal is ap- 
plied to recording data processing system 17 and re- 
corded in recording medium 18 as will be discussed 
more fully below. 

In the audio path of recording apparatus 100, the 
analog audk> output of audio switching system 9 is con- 
verted to digital audio signal by audio signal A/D con- 
version system (A/D converter) 10. The digitized audio 
output from A/D converter 1 0 is applied to both an audio 
features extraction system 12 (detecting means) and to 
an audio signal band compression system 11 , the latter 
of which compresses the audio when necessary in ac- 
cordance with a standard protocol such as MPEG. 

Audio features extraction system 12 includes 
processing circuitry to analyze certain characteristics of 
the digital audio signal applied thereto from system 10, 
to thereby extract audio features from the signal. The 
quantized audio signal is quadrature - transformed in ex- 
traction system 12 based on operating parameters sup- 
plied thereto from system controller 14, and then sub- 
jected to a specified operation in accordance with an op- 
erating command also supplied by system controller 14. 
The audio signal is analyzed in extraction system 12 on 
a block by block basis, where each block corresponds 
to a specific time segment (e.g., frame or set of frames) 
of the audio signal to be recorded. By way of example, 
to determine which portions of the audio signal corre- 
spond to a mute condition, the audio signal may be an- 
alyzed In 0.02 second blocks to determine which blocks 
contain muted or low level audio. The audio signal is 
analyzed over larger blocks of time to determine which 
of the larger blocks contain audio corresponding to. e. 
g.. instrument music, human speech or vocal music. 
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Based on the results of the analysis performed by ex- 
traction system 12, subcodes are generated by a sub- 
codes generation system 13 to characterize each such 
block of the audio signal. Certain subcodes are tempo- 
rarily stored within memory 1 6. s 

In particular, for each audio block of duration "Dl" 
(e.g.. 0.02 seconds duration), a subcode "A" is gener- 
ated as indicative of whether or not that block corre- 
sponds to muted audio. For each block of a longer du- 
ration "D2", a subcode "B" is generated which is indic- io 
ative of the type of audio contained in that block, e.g., 
conversation, instrument music or vocal music. Sub- 
codes A are directly transferred to recording data 
processing system 17, whereas subcodes B are trans- 
ferred to memory circuit 1 6 for temporary storage there- is 
in. Typically, when recording of audio/video data is com- 
plete, all subcodes B are transferred as a block from 
memory 16 to recording data processing system 1 7 (via 
subcodes generation system 1 3) under the control of 
system controller 1 4. 

In any event, as the analog audio signal is received 
by recording apparatus 100, it is digitized, compressed 
and recorded as data, generally in real time, on a pre- 
determined portion of the recording medium 18. As the 
subcodes A and B are generated, a user table of con- ^5 
tents (U-TOC) is generated to correlate the audio data 
being stored on recording medium 1 6 with the subcodes 
characterizing the respective segments of the audio da- 
ta. The U-TOC is stored on recording medium 18. As 
shown in FIG. 2, the digitized audio data may be record- 30 
ed on the outermost region of the disk, and the U-TOC 
data may be recorded on a predetermined area of the 
disk outside the innermost region where a table of con- 
tents (TOC) is recorded. 

System controller 1 4 is configured to control the re- 3S 
spective processing systems by supplying control sig- 
nals thereto based on a user's instruction inputted 
through recording control signal input system 15. e.g., 
a keyboard or the like. 

Recording data processing system 17 (recording 
means) operates to multiplex bit sequences that are 
supplied from video compression system 6, audio com- 
pression system 11, and subcodes generation system 
1 3, and to transfer the multiplexed data to recording me- 
dium 18 and record the data thereon. (It is noted that 45 
some or all of the subcodes may optionally be trans- 
ferred as a block without being multiplexed with the au- 
dio and video data, in which case recording system 17 
just records the block of subcodes on the recording me- 
dium without multiplexing). Recording medium 18 may ^ 
be an optical disk, a hard disk, a memory card, or the 
like. 

FIG. 3 is a flowchart illustrating process steps exe- 
cuted within system controller 14 to control various as- 
pects of the recording process of recording apparatus 55 
100. At the outset (step 81) system controller 14 deter- 
mines an operating mode based on a user's instruction 
input to input system 15, e.g, by detecting depression 
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of a particular mode key In step S2, it is ascertained 
whether the operating mode determined in step SI is a 
normal recording mode. I.e., a mode In which both video 
and audio signals are recorded. If so, the routine pro- 
ceeds to step S3, where system controller 14 sets op- 
eration parameters A, B, C. and D in the audio features 
extraction system 12. For reasons that will become ap- 
parent below, the values of parameters A-D are set in 
accordance with the type of audio signal selected by the 
user, e.g.. audio signal from a television signal, VTR out- 
put or microphone. Thus, the values of parameters A-D 
correspond to the switch ing state of audio switch ing sys- 
tem 9. which is controlled by system controller 14. 

Before proceeding further with FIG. 3, reference is 
made to FIG. 4 which shows a flowchart illustrating a 
routine within audio features extraction system 12 and 
subcodes generation system 1 3. For the presently de- 
scribed embodiment, it is assumed that one data bkx:k 
contains N bits or bytes of audio data, where N Is a pre- 
determined integer. By way of example, one block may 
contain digitized audio data corresponding to a 0.02 
second long segment of the input analog audio signal. 
It is further assumed that subcode A is calculated on a 
bkx^k-by-block basis, and that subcode B is calculated 
on an M-block basis, where M is a specified Integer. In 
step S21 , audio features extraction system 12 receives 
operation parameters A, B, C, and D from system con- 
troller 14, which parameters have been set in accord- 
ance with the type of audio signal selected as discussed 
previously If, in step S22, it is detennined that M blocks 
have not yet been processed, the single block process 
("1 -block process') of step S27 is executed. 

FIG. 5 is a flowchart illustrating the 1 -block process. 
In step S31 , a fast Fourier transform (FFT) is performed 
on a single block of the audio signal to determine the 
spectral components of the portion of the signal corre- 
sponding to that block. Next, in step S32, audio signal 
power is calculated from Nb frequency components that 
are specified by operation parameter B supplied from 
system controller 14. The portion of the input audio sig- 
nal band to be used in calculating signal power is thus 
determined by parameter B. For example, an audio sig- 
nal from a camera system includes a considerable 
amount of low frequency components such as zip, while 
an audk) signal of a television broadcast includes a con- 
siderable amount of components at harmonic frequen- 
cies of the frame frequency Hence, for the signal power 
calculation, noise-Induced errors can be reduced by ap- 
propriately filtering out undesired frequencies in accord- 
ance with the type of audio signal being analyzed. 

In the next step, S33, it is ascertained whether or 
not the signal is mute. That is, if the calculated power 
value is smaller than parameter C, the signal is deter- 
mined to be mute within the associated block. Optional- 
ly, when the computed power is larger than C, a further 
determination can be made as to whether the signal 
power is within one of several predetermined ranges. In 
any event, subcode A is generated in step 534 in ac- 
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cordance with the determination of step S33, and sup- 
plied to the recording data processing system 17. Sub- 
code A will either be of a first predetermined value for a 
mute condition, or one of a number of other predeter- 
mined values each corresponding to a different range 
of signal power levels. In general, signals of a television 
broadcast and of a camera system (e.g., camcorders) 
have different signal to noise (S/N) ratios because of dif- 
ferences in microphone perfomiance. Therefore, the 
possibility of erroneous detection can be reduced by ap- 
propriately selecting the value of parameter C in accord- 
ance with the selected switching position of audio 
switching system 9 (and the control thereof by system 
controller 14). 

The next step in the 1 -block process. S35. is to de- 
termine the spectral peak P(f), i.e., the peak amplitude 
at any one of Nd specified frequencies, where Nd is an 
integer. The spectral peak so determined is then stored 
temporarily in memory circuit 16. The Nd frequencies 
are determined based on the parameter D supplied from 
system controller 14. As discussed above, the spectral 
components that add noise to the audio signal are a 
function of the audio signal type. Accordingly, the peak 
spectral power can be calculated with higher accuracy 
by eliminating those noise components from the subject 
frequency components. 

Once the spectral peak P(f) for the single block is 
computed and stored temporarily, the software flow re- 
turns to steps S21 and S22 of FIG. 4, The process con- 
tinues until step S27 is executed M times whereby spec- 
tral peaks P(f) are computed and stored in memory for 
M blocks of the audio signal. 

Once it is determined in step S22 that M blocks of 
audio data have been processed, then, in step S23, the 
software calculates an average continuous length 
'CL/^yQ* in which the spectral peaks P(f) are determined 
to be of similar levels to one another. The calculation of 
CL^vQ entails comparing the spectral peaks P(f) of a se- 
quence of bbcks to one another and determining seg- 
ment lengths at which the peaks of sequential blocks 
remain within a predefined range of one another. In step 
S24, it is then determined whether the computed value 
of CLavg ^^^^ series of M blocks is larger than pa- 
rameter A supplied from system controller 1 4. In gener- 
al, the average number of blocks for computing CLavg 
is large when the pitch of sound is relatively stable as in 
the case of music. Conversely, the average number of 
blocks is small when the audio signal comprises human 
speech or conversation. For the case of musk:, it may 
be determined that certain values for CL^vg correspond 
to music produced by an Instrument while other values 
correspond to vocal music. 

In any event, in step S25, a subcode B is estab- 
lished for each M-biock segment of the audio signal as 
corresponding to the particular type of audio signal. In 
this example, it is determined whether the signal is mu- 
sic or not based on whether the value CL^vg 's larger 
than parameter A provided by system controller 14, and 



subcode B Is generated accordingly. The subcode B is 
stored in memory circuit 16 in step S26 and the process 
is then repeated for the next M blocks, for as long as the 
operating mode remains the normal recording mode. In 

5 general, signals of a television broadcast and of camera 
system (e.g., camcorder) have different rates of occur- 
rence of on-music items such as conversational speech. 
Therefore, the possibility of erroneous detection can be 
reduced via appropriate selection of the value of param- 

10 eter A in accordance with the type of input audio signal 
selected. 

Returning to FIG. 3, while the audio signal is being 
processed in accordance with the aforedescribed con- 
trol in step S3, the video is continually processed and 
IS digitally recorded as well. That is, In step S4, the com- 
pressed video output signal from video processing sys- 
tem 6 is transferred to recording medium 18 through re- 
cording data processing system 17 via control com- 
mands from system controller 14. System controller 14 
also controls the audio processing system 11 in step 85 
so that a compressed audio signal is transferred to re- 
cording medium 1 8 via recording processing system 1 7. 
In step s6, system controller 14 controls recording 
processing system 1 7 so that the above-discussed sub- 
codes "A" generated by subcodes generation system 1 3 
are supplied to recording processing system 17 and 
transferred to recording disk 1 8. Then, in step S7, if one 
or more subcode B has been generated, subcodes gen- 
eration system 1 3 is instructed to transfer the same to 
memory circuit 16. 

Thereafter, the process returns to steps 81 and 82. 
If the operating mode is still the normal operating mode, 
the aforedescribed process is repeated. If, on the other 
hand, the operating mode has changed such as by user 
depression of a 'stop recording" key or the like, the rou- 
tine proceeds to step 88, where it is ascertained whether 
the previously generated subcodes B have already 
been recorded onto recording medium 1 8. If not, system 
controller 14 controls subcodes generation system 13 
(step S9) so as to read out subcodes B stored in memory 
circuit 1 6 and transfer them to recording medium 1 8 via 
recording data processing system 17. 

In the above manner, when a transition is made 
from the normal recording mode to some other mode, 
subcodes B are recorded as a block onto a predeter- 
mined region of recording medium 18, e.g., on the U- 
TOC region as discussed above. 

If in step 88 the subcodes B have already been re- 
corded on recording medium 18, the next step (step 
S10) is to determine whether the current operating 
mode is a stop mode. If so, a stop process is executed 
in step 812. Otherwise, it Is determined in step 811 
whether the operating mode is a removal mode, and if 
so, a removal mode process is executed in step SI 3, 
and the routine returns to step SI . 

FIGS. 6 and 7 are timing diagrams showing output 
timing for signals of the respective audio and video 
processing systems. FIG. 6 shows output timing in a 
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normal recording mode. As is apparent from the bottom- 
most timing bar of the figure, during the normal operat- 
ing mode, the audio, video and muting data (subcode 
A) are recorded continually on the recording medium on 
a frame by frame basis. In the presently described em- 
bodiment, the compressed video data of the {N-1)st 
frame is recorded first, followed by the compressed au- 
dio data of the (N-1)st frame, then the subcode A for 
frame N-1 , which is fotk)wed by the video data of the Nth 
frame, and so forth. It is understood that different data 
storing sequences may be implemented In the alterna- 
tive. The other timing bars of FIG. 6 depict how the il- 
lustrative recording sequence is implemented. The 
compressed video data of any given frame, e.g., the Nth 
frame, is output from video compression system 6 just 
prior to the compressed audio data being output from 
audio compression system 11 . Sufficient time needs to 
be allocated to perform the aforedescribed "1 -block 
process' on the current frame, i.e., to perform a quad- 
rature transform (e.g., FFT) on the compressed audio 
data, to determine subcode A and the spectral peak P 
(f) as described above for the frame, where one frame 
corresponds to a single block in this example. Thus, the 
quadrature transform for the Nth frame is performed pri- 
or to outputting the compressed audio data of the Nth 
frame, while the generation of subcode A for the Nth 
frame is completed immediately after the compressed 
audio data is outputted. Also. P(f) is stored for each 
frame in memory circuit 16. After M frames have been 
processed, e.g.. four frames in the example of FIG. 6 
(represented by frames N-1 to N+2) then subcode B is 
generated for that M frame block and written to memory 
circuit 16. 

FIG. 7 is a timing diagram showing illustrative out- 
put timing of signals outputted from the respective 
processing systems as transitions are made from a nor- 
mal recording mode to a stop mode, and then to a re- 
moval mode. In this example, it is assumed that the tran- 
sition to the stop mode is effectuated when frame N is 
captured. After the compressed video and audio signals 
and subcodes A corresponding to frames N-1 and N are 
recorded onto recording medium 1 8, all of the subcodes 
B stored in memory circuit 16 are read out by subcodes 
generation system 1 3 and recorded onto recording me- 
dium 18 via the recording data processing system 17. 

The particular sector configuration and format used 
for the subcodes A and B are not critical to the present 
Invention. The following are presented by way of exam- 
ple: 



(continued) 



Example of sector configuration of subcode A: 


Sync pattern 


j 8 bytes 


Subcode 


j 9 bytes 


Parity 


1 6 bytes 


User data 


1 2.048 bytes 


ECC (error correcting code) 


j 256 bytes 



Example of sector configuration of subcode A: 



Total 



2.329 bytes 



10 



IS 



20 



25 



30 



35 



Example of format of subcode A: 



Sector number 
Audio level 
Total 



4 bytes 

5 bytes 
9 bytes 



Example of audio level: 


000 


mute 


001 


level-0 


010 


levet-1 


Oil: 


level -2 


1XX 


level-N 



Example of configuration of user table of contents 
(U-ToC) including the recorded subcodes B: 



Sync pattem | 


8 bytes 


Parity | 


8 bytes 


user data j 


2.048 bytes 


Subcode B 


8.192 bytes 


ECC (error correcting code) | 


256 bytes 


TDtal 1 


10,512 bytes 





Example of format of subcode B: 


40 


0 see's type 


1 byte 


1 see's type 


1 byte 


45 


8,191 see's type 


1 byte 




Total 


8,192 bytes 



50 



55 



In the above example, '0 see's type" represents, for 
instance, the type of audio, e.g., voice, music, etc., that 
will be reproduced during a period of 0 through 1 sec- 
onds from the start of a reproduction. 'I see's type" rep- 
resents the audio type reproduced during a period of 1 
through 2 seconds from the start, and so forth. "8191 
see's type* represents the audio type that will be repro- 
duced during a period of 81 91 to 81 92 seconds from the 
start. For example, the audio types may be defined as 
follows: 



6 
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Example of n see's type: 
000: mute 

001: music-0 (e.g., instrument music) 

010: music-1 (e.g., vocal music) 

011: human voice (e.g., conversational speech) 

1XX: other types 

Accordingly, it should be readily apparent that em- 
bodiments of the Invention such as recording apparatus 
100 just described, are advantageously capable of re- 
ceiving an analog audio or audiovisual program such as 
a broadcast, recording the same digitally while simulta- 
neously analyzing the audio content as it is being re- 
corded, and creating a user table of contents (U-TOC) 
characterizing the different portions of the recorded au- 
dio program. During playback, the user can advanta- 
geously employ the U-TOC (with appropriate electron- 
ics such as those to be described) to find certain portions 
of the recorded material, skip portions with undesired 
audio types, and so forth. Consequently, the user is pro- 
vided with a highly efficient tool during the playback 
process. 

illustrative apparatuses for reproducing audio and 
video that have been stored along with additional audio 
feature information on a digital storage medium in the 
above -discussed manner, will now be described. 

FIG. 8 is a block diagram showing an illustrative 
configuration of an information reproduction apparatus 
200 according to an embodiment of the invention. Re- 
cording medium 18 is similar to that shown in FIG. 1 . e. 
g., an optical disk, memory card, or magnetic hard disk. 
Audio and video data and corresponding subcodes A 
and B characterizing the different time segments of the 
audio, are recorded on the recording medium 18. If the 
recording medium is an optical disk, data can be record- 
ed according to the following format: 



Example of sector configuration: 


Sync pattern 


1 8 bytes 


Subcode 


9 bytes 


Parity 


8 bytes 


User data 


1 2,048 bytes 


ECC (error correcting code) 


256 bytes 


Total 


2.329 bytes 



Example of subcode format: 

Sector number | 4 bytes 
Audio ID I 5 bytes 

Total j 9 bytes 

By way of example, 5 byte audio IDs may be stored 
with the bwest one byte representing an audio level as 
follows: 



XXXXO: level-0 
XXXX1 : level-1 
XXXX2: level-2 
XXXXA: level-N; 

5 

and the second lowest byte represents idio content in 
this example: 

XXXOX: mute 
10 XXXIX: music (pop) 

XXX2X: music (classic) 



IS XXXAX: voice 

tn the above example. X represents an arbitrary val- 
ue of 0 to 255. 

Although the above example is directed to the case 

^0 in which a subcode Is located In the same sector as vid- 
eo and audio data, as an altemative. a given sector may 
contain only subcodes. Further, as in the case of a mini- 
disc (MD), subcodes may be arranged as a batch in a 
given region such as a U-TOC region. For this case, an 

25 apparatus can be implemented by using the same con- 
figuration and processes as in the above example. 

In the following discussion, reproduction apparatus 
200 will be described with the assumption that recording 
medium 1 8 is an optical disk. A driving circuit 21 (in this 

30 case, an optical disk driving circuit) is configured to ser- 
vo-control optical disk 18 to enable specified sectors of 
the disk to be accessed in response to an external con- 
trol signal. An optical pickup (not shown), which may be 
part of reproduction processing system 22, reads out 

35 signals from disk 1 8. and amplifies and demodulates the 
same. Reproduction data processing system 22 oper- 
ates to separate video data, audio data, and subcodes 
from data that is read out from recording medium 18, 
and to provide the subcodes to subcodes detection sys- 

40 iemA28. 

Video signal band expansion processing system 23 
operates to expand the compressed video data supplied 
from processing system 22, and to convert the expand- 
ed data into a baseband signal of, e.g., 1 3.5 MHz. YUV, 

45 or the like. Video signal D/A conversion system 24 con- 
verts received digital video data Into an analog video sig- 
nal. Audio signal band expansion processing system 25 
expands audio data that has been compressed accord- 
ing to the MPEG scheme or the like. Audio signal D/A 

so conversion system 26 converts received digital audio 
data into an analog audio signal. 

Readout region calculation system 27 (control 
means) calculates a sector number of recording medi- 
um 18 based on a control signal sent from system con- 

55 troller 29 or subcodes detection system A 28 (determin- 
ing means). Detection system 28 is configured to deter- 
mine whether subcodes (and associated frames) that 
are read out from recording medium 18 correspond to 
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the audio type of a current reproduction mode. Detection 
system 28 supplies a control signal to calculation sys- 
tem 27 in accordance with this determination. Detection 
system 28 also provides video expansion system 23 and 
audio expansion system 25 with a control signal as an 
instruction to retrain from c-:putting data from a frame 
when that frame is to be skipped. System controller 29 
is operative to control the entire recording apparatus 
200 based on data input by a user through *nput system 
30, e.g., a user panel of depressible selection keys. The 
various systems of apparatus 200, e.g., systems 22, 23, 
25 and 27-29, may be embodied either as separate 
firmware or as part of a common processor with suitable 
software running thereon to realize the functions of the 
respective systems. 

Operation of the above-described apparatus of FIG. 
8 will now be described with reference to the flowchart 
of FIG. 9. In step S41, system controller 29 determines 
an operating mode based on user depression of one or 
more keys of input system 30. The operating mode may 
be selected from a normal reproduction mode in which 
audio and video data are output continuously or, one or 
more "skipping" reproduction modes in which a speci- 
fied audio type is skipped during reproductbn. In S42, 
readout region calculation system 27 calculates a sector 
number of the next subcode to be read out. Next, in step 
S43, the calculated sector number is supplied to driving 
circuit 21 , and the subcode corresponding to the calcu- 
lated sector number is read out from recording medium 
1 B under the control of driving circuit 21 . The calculated 
sector number and associated subcode are supplied to 
detection system 28 via processing system 22. 

Next, in step S44. it is determined whether the cur- 
rent operating nxxJe is the normal reproduction mode, 
and if so, the process flows to step S45. where calcula- 
tion system 27 calculates the sector number of the next 
frame and supplies it to driving circuit 21 . In step S46, 
compressed audio and video data corresponding to the 
next frame are read out from recording medium 18 un- 
der the control of driving circuit 21 . This compressed vid- 
eo and audio data are transferred to video expansion 
system 23 and audio expansion system 25, respective- 
ly via processing system 22 (steps S47, S48). The com- 
pressed video data that has been transferred to video 
expansion system 23 is expanded therein, then convert- 
ed into an analog video signal by video D/A converter 
24, and finally output. The compressed audio data that 
has been transferred to audio expansion system 25 is 
expanded therein, converted intoan analog audio signal 
by audio D/A converter 26. and then output. The routine 
then returns to step S41 to repeat the foregoing process. 

If, in step S44, system controller 29 determines that 
the current operating mode is different than the normal 
reproduction mode, e.g., that the mode is reproduction 
mode A (step S49), or reproduction mode B (step S51), 
then apparatus 200 is controlled to output audio and vid- 
eo data in accordance with the reproduction mode se- 
lected. For instance, the reproduction mode selected by 



the user may be designed to cause apparatus 200 to 
skip one particular type of audio during playback. In this 
case, frames are skipped if their associated subcode 
corresponds to the audio type to be avoided. Detection 

5 system 28 would then instruct expansion systems 23 
and 25 not to output data corresponding to that frame. 
Concomitantly, calculation system 27 is instructed to im- 
mediately skip the sector of that frame and move on to 
subsequent frame sectors until a frame is found having 

10 a different subcode than the one to be avoided. 

Likewise, another reproduction mode may be in- 
cluded to allow for playback of only one type of audio 
while skipping all other types. In this case, detection sys- 
tem 27 provkies "skip" commands as described above 

15 to calculation system 27 and expansion systems 23, 25 
when the current frame subcode does not correspond 
to the audio type selected to be played back. Yet another 
reproduction mode may be Included which implements 
a specific viewing and/or listening speed Inputted by the 

20 user, in which case both video and audio signals can be 
skipped in synchronism with one another by calculating 
a ratio between frames to be reproduced and frames to 
be skipped. 

In the example of FIG. 9. it is assumed that repro- 

25 duction mode A corresponds to a mode in which frames 
with muted or low level audio are to be skipped. If it is 
determined in step S50 that a frame is to be skipped 
because its subcode corresponds to a low audio level, 
then the routine returns to steps S42 and S43 where the 

30 sector for the subsequent frame is calculated, the sub- 
code is read out and the process is repeated. If the frame 
is not to be skipped, the routine returns from inquiry S50 
to step S45 to commence the playback process for the 
audioA/ideo data of that frame. 

35 As described above, a variety of reproduction oper- 
ations can be performed by determining the content of 
a subject subcode in response to a command from sys- 
tem controller 29, and then calculating readout sectors 
based on the determination. With this technique, since 

40 a vkleo signal and an audio signal are always skipped 
or reproduced in synchronism with each other, no timing 
deviation occurs between them. 

FIG. 10 is a timing diagram showing output timing 
of the signals output from respective processing sys- 

45 terns in the normal reproduction mode and in an illus- 
trative reproduction mode A. In the normal reproduction 
mode, every frame is read out irrespective of the sub- 
code value. In reproduction mode A, frames may be 
skipped depending on the read-out subcode value. In 

50 the example of FIG. 10, frames having audio levels of 
level-0 and level-1 are skipped, i.e.. frames in which the 
lowest byte of the illustrative 5-byte audio ID ot the sub- 
code is "0" or "1" are skipped. Thus, frames N+1, N+2, 
and N+4 are skipped and frames N+3, N+5. and N+6 

55 are read out from recording medium 1 8 under the control 
of readout region calculating system 27. Video and au- 
dio signals of the non-skipped frames, i.e., frames N+3, 
N+5, and N+S in this example, are reproduced in syn- 
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chronism with each other. 

FIG. 11 is a block diagram showing another illustra- 
tive configuration of an information reproduction appa- 
ratus 300, which is another embodiment of the inven- 
tion. Reproduction apparatus 300 differs from the 
above -described apparatus 200 of FIG. 8 in that a sub- 
code detection system B 41 in FIG. 11 is substituted for 
the system A 28 in FIG. 8. and a memory circuit 42 (stor- 
ing means) is included In FIG. 11 . Since the other com- 
ponents of apparatus 300 and the operation thereof are 
the same as the corresponding components of appara- 
tus 200, descriptions thereof will be omitted. 

Subcodes detection system B 41 is configured to 
read out subcodes that are recorded on the recording 
medium 18 and then store those subcodes in memory 
circuit 42. Preferably, these subcodes are read out from 
recording medium 18 as a block during an allocated time 
inteival. System 41 also operates to receive a "repro- 
duction mode" control signal from system controller 29 
indicative of which audio data is to be reproduced (or 
skipped), In response, system 41 reads out subcodes 
stored in memory 42 and determines whether to repro- 
duce the audio/video data of a given frame based on a 
comparison of that frame's associated subcode with the 
reproduction mode selected. System 41 then controls 
readout region calculation system 27 in accordance with 
the comparison. 

Memory circuit 42 is a semiconductor memory de- 
vice or the like, such as a random access memory, and, 
by way of example, may be stored with the following 
subcodes: 



Address 


Data 


0000 


Oth-frame subcode 


0001 


Ist-frame subcode 


xxxx 


Nth-frame subcode 



Operation of reproducing apparatus 300 will now be 
described with reference to the flowchart of FIG. 12. At 
the start (step S61), subcodes detection system B 41 
reads out all subcodes stored on recording medium 18, 
and transfers the subcodes to memory 42 for storage. 
The subcode readout process is effectuated by system 
41 providing control commands to calculation system 
27, which in turn provides control signals to driver circuit 
21 for accessing the proper region of the disk. 

Next, in step S62, an operating mode is determined 
based on data input via a user key depression through 
input system 30. In step S63, detection system 41 reads 
out a subcode of a specific frame from memory 42, i.e., 
the next frame in a reproduction sequence to be select- 
ed as a candidate for possible playback of audio/video 
data. If in step S64, the current operating mode is de- 



termined to be the normal reproduction mode, the sub- 
codes are irrelevant since no frames are skipped. In this 
case, calculation system 27 calculates a sector number 
of the next frame and driving circuit 21 is controlled ac- 
5 cordingly (step S65). AudioA^ideo data of the next frame 
is then read out from recording medium 1 8 and supplied 
to reproduction data processing system 22 (step S66). 
Processing system 22 then separates the audio data 
from the video data, transfers the audio data to expan- 
sion system 25 and the video data to expansion system 
23 (steps S67, S68). The signals are expanded in the 
respective expansion systems 23, 25, converted to an- 
alog signals by respective D/A converters 24, 26 and 
then output. The process is then repeated for the sub- 
sequent frames. 

If, in step 864, system controller 29 determines that 
the current operating mode is different than the normal 
reproduction mode, e.g., reproduction mode A (step 
S69) or reproduction mode B (step S71 ), then apparatus 
300 is controlled to output audio and video data in ac- 
cordance with the reproduction mode selected. For in- 
stance, as was the case for recording apparatus 200, 
some of the alternative reproduction modes may be de- 
signed to cause apparatus 300 to skip a particular type 
of audio during playback. In this case, frames are 
skipped if their associated subcode corresponds to the 
audio type to be avoided. Another reproduction mode 
may be included to allow for playback of only one type 
of audio while skipping ail other types. Yet another re- 
production mode may be included which implements a 
specific viewing and/or listening speed inputted by the 
user, as mentioned previously. 

In the example of FIG. 12, if it is detenmined in step 
S70 that a frame is to be skipped based on a positive 
correlation with its subcode and reproduction mode A 
(e.g., mute condition skipping or vocal song skipping, 
etc.) then the routine returns to step S63 where the sub- 
code from the subsequent frame is read out and the 
process is repeated. If the frame is not to be skipped, 
the routine returns to step S65 to commence the play- 
back process for the audkWideo of that frame as de- 
scribed above. 

FIG. 13 is a timing diagram illustrating the timing of 
signals that are output from the respective processing 
systems in making a transition from a normal reproduc- 
tion mode to reproduction mode A. When the apparatus 
is initially powered up or a new optical disk Is inserted, 
etc.. subcodes are initially read out as a block in a sub- 
codes readout mode. In the normal reproduction mode, 
a subcode corresponding to a current frame to be played 
back is read out from memory circuit 42, and video and 
audio data of that frame are read out from recording me- 
dium 1 8. The video data is supplied to and expanded by 
video expansion system 23, then converted into an an- 
alog video signal by video D/A converter system 24, and 
finally output. The audio data is supplied to and expand- 
ed by audio expansion system 25, then converted into 
an analog audio signal by D/A converter 26 for output- 
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ting. 

In reproduction mode A, frannes having subcodes 
indicating audio levels lower than a specified level are 
skipped during playback. In this example, frames whose 
audio levels are lower than level-2 are skipped while 
frames with audio levels higher than levei-1 are repro- 
duced. Since video data and audio data are skipped in 
synchronism with each other, reproduced video and au- 
dio signals are also synchronized with one another. 

It is understood that the above-described embodi- 
ments of recording and reproduction apparatuses can 
be modified in a variety of ways without departing from 
the spirit and scope of the invention. For example, while 
the above embodiments specifically illustrate discrimi- 
nation between two classes of audio -- low level audio 
and music - the embodiments can be modified to allow 
for discrimination annong three or more types of audio. 
Further, Instead of detecting one spectral peak P(f) for 
each block in the computation for discriminating be- 
tween music and non-music, the discrimination may al- 
ternatively be performed by detecting a plurality of spec- 
tral peaks relative to the highest level for each bkx:k, 
and calculating their continuity, e.g., over M blocks. As 
another alternative, the discrimination between music 
and non-music and/or between muted and non-muted 
audio may be made by using one of various, currently 
proposed speech recognition devices, with the discrim- 
ination result being recorded as a subcode. 

Further, while the above embodiments are directed 
to the case in which skips are effected on a frame-by- 
frame basis, in the audio system the amount of noise 
due to switching between frames can be minimized by 
performing cross-fading before and after each skip. Al- 
ternatively, switching can be controlled by detecting ze- 
ro-cross points. 

Moreover, in the above embodiments, playback and 
skipping are controlled on a frame-by-frame basis 
based on subcode contents. However, playback of a 
short audioA/ideo segment, for instance, a one or two- 
frame segment, may be recognized in many cases 
merely as noise. This problem can be solved by setting 
In advance the minimum continuous sequence of 
frames to be played back. Then, frames would be played 
back, rather than skipped, so long as the minimum se- 
quence has not yet been reached, even if their subcodes 
indicate a skip. 

As another modification, subcode A (which is indic- 
ative of the audio level feature) may be generated for 
every two frames rather than for every frame as de- 
scribed. Further, another reproduction mode based on 
subcode A may be included which allows a user to au- 
tomatically skip louder portions (higher levels) of the au- 
dio signal, e.g., loud music, while playing back audio at 
lower levels. 

Further, although the above embodiments are di- 
rected to the application of using subcodes relating to 
audio level and music, various forms of reproduction can 
be realized by generating subcodes indicating other au- 



dio features such as a subcode for identification of a 
speaker. 

While the present invention has been particularly 
shown and described in conjunction with preferred em- 

s bodiments thereof, it will be readily appreciated by those 
of ordinary skill in the art that various changes may be 
made to the disclosed embodiments without departing 
from the spirit and the scope of the invention. Therefore, 
it is intended that the appended claims be interpreted 

10 as including the embodiments described herein as welt 
as all equivalents thereto. 



Claims 

15 

1 . An information recording apparatus for recording at 
least an audio signal onto a recording medium, 
comprising: 

20 detecting means for detecting a feature of the 

audio signal; and 

recording means for recording additional infor- 
mation that corresponds to said detected fea- 
ture onto the recording medium together with 
2S the audio signal. 

2. The information recording apparatus according to 
claim 1. wherein said recording means further 
records a video signal associated with the audiosig- 

30 nal onto said recording medium together with the 
audio signal and said additional information. 

3. The information recording apparatus according to 
claim 1 , wherein said recording means records, in 

35 a distributed manner, the audio signal and said ad- 
ditional information In a common region of said re- 
cording medium. 

4. The information recording apparatus according to 
40 claim 3, wherein said additional information is re- 
corded for each of a plurality of blocks of the audio 
signal. 

5. The information recording apparatus according to 
45 claim 1, wherein said additional information is re- 
corded in a predetermined region of said recording 
medium that is different from a region in which at 
least the audio signal is to be recorded. 

so 6. The information recording apparatus according to 
claim 5, wherein all said additional information is re- 
corded in said predetermined region during a time 
interval in which said audb signal is not being re- 
corded. 

55 

7. The information recording apparatus according to 
claim 1 , wherein the detecting means performs a 
quadrature transform on the audio signal periodical- 
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ly at a predetermined time interval, and detects the 
feature of the audio signal by determining a corre- 
lation between resulting energy components and 
energy distribution. 

8. The information recording apparatus according to 
claim 7, wherein said detecting means detects the 
feature as music if an average continuous length of 
spectral peaks that are within a predetermined am- 
plitude range of one another, is greater than a spec- 
ified value. 

9. The Information recording apparatus according to 
claim 7, further comprising an input switch for re- 
ceiving plural types of analog audio signals and pro- 
viding said audio signal at an output thereof in ac- 
cordance with a selected switching state, and 
wherein said detecting means detects the feature 
of the audio signal as a function of the type of analog 
audio signal selected. 

10. An information recording method for recording at 
least an audio signal onto a recording medium, 
comprising the steps of: 

detecting a feature of the audio signal; and 
recording additional information that corre- 
sponds to the detected feature onto the record- 
ing medium together with the audio signal. 

11. An informatbn reproduction apparatus for repro- 
ducing at least an audio signal corresponding to au- 
dio data recorded on a recording medium on which 
additional information relating to at least the audio 
signal is also recorded, comprising: 

reading means for reading out a portion of the 
additional information prior to any reproduction 
of a corresponding portion of the audio signal; 
determining means for determining whether to 
reproduce said corresponding portion of the au- 
dio signal in accordance with said read-out por- 
tbn of said additional information and a current 
operating mode; and 

control means for controlling reproduction of 
the corresponding portion of the audio signal in 
accordance with a determination by said deter- 
mining means. 

12. The information reproductbn apparatus according 
to claim 11, wherein: 

a video signal corresponding to the audio signal 
is further recorded on said recording medium; 
said reading means reads out the portion of the 
additional information prior to any reproduction 
of corresponding portions of the video signal 
and the audio signal; 
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said determining means determines whether to 
reproduce a portion of the video signal and the 
portion of the audio signal corresponding to the 
read-out portion of the additional information in 
5 accordance with the read-out portion of the ad- 

ditional information and the current operating 
mode; and 

said control means controls reproduction of the 
portions of the video signal and the audio signal 
10 in accordance with the determination by said 

determining means. 

13. The information reproduction apparatus according 
to claim 1 2, wherein said control means controls the 

IS reproduction so that the video signal and the audio 
signal are reproduced in synchronism with each 
other. 

14. The Information reproduction apparatus according 
20 to claim 1 2, wherein the additional Information Is re- 
corded in a distributed manner in a region of the re- 
cording medium where the video signal and the au- 
dio signal are recorded. 

25 15. The information reproduction apparatus according 
to claim 14, wherein said audio signal and associ- 
ated video signal are recorded on the recording me- 
dium in blocks, and the additional informatbn is re- 
corded for each block of the video signal and the 

30 audio signal so recorded. 

16. The information reproduction apparatus according 
to claim 1 2. wherein the additional information is re- 
corded in a predetermined region of said recording 

3S medium that Is different from a region in which the 
video signal and the audio signal are recorded. 

17. The information reproduction apparatus according 
to claim 1 6, wherein said reading means reads out 

40 all said additional information as a block prior to any 
reproduction of said audio and video signals. 

18. The information reproduction apparatus according 
to claim 17, further comprising storing means for 

45 storing the additional information that has been 
read out as a block by said reading means, wherein 
said determining means is operable to determine, 
as a function of a portion of the additional Informa- 
tion stored in the storing means, whether to repro- 

50 duce portions of the video signal and the audio sig- 
nal corresponding to the portbn of the additional in- 
formation. 

19. The information reproduction apparatus according 
55 to Claim 1 2. wherein the additional information indi- 
cates a level of the audio signal. 

20. The information reproduction apparatus according 
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configured to detect at least one said feature select- 
ed from the group consisting of an audio power level 
and a music characteristic. 

5 27. The recording apparatus according to claim 23. 
wherein said feature information is recorded in a 
predetermined region of the recording medium that 
is different from a region in which at least the audio 
signal is recorded. 

10 

28. The recording apparatus according to claim 27, 
wherein all of said detected features are recorded 
in the predetermined region during a time interval 
in which said audio signal is not being recorded on 
IS said recording medium. 
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to claim 1 2, wherein the additional information indi- 
cates a type of the audio signal. 

21 . The information reproduction apparatus according 
to claim 1 2, wherein said control means controls the 
reproduction of the video signal and the audio signal 
so that a ratb between portions of the video and 
audio signals that are reproduced and portions of 
the video and audio signals that are not reproduced 
becomes a specified value. 

22. An infonnation reproduction method for reproduc- 
ing at least an audio signal corresponding to audio 
data recorded on a recording medium on which ad- 
ditional information relating to at least the audio sig- 
nal is also recorded, comprising the steps of; 

reading out a portion of the additional informa- 
tion prior to any reproduction of a correspond- 
ing portion of the audio signal; 
determining whether to reproduce the portion 
of the audio signal corresponding to the read- 
out portion of the additional information in ac- 
cordance with the read-out portion of the addi- 
tional infornnation and a current operating 
mode; and 

controlling reproduction of said corresponding 
portion of the audio signal in accordance with 
the determining step. 

23. A recording apparatus for digitally recording at least 
an audio signal onto a recording medium, compris- 
ing: 

an audio features extraction system configured 
to detect a feature of each of a plurality of time 
segments of the audio signal; and 
a recording processing system for recording 
feature information identifying said detected 
feature of each said time segment of the audio 
signal onto the recording medium together with 
data corresponding to the audio signal. 

24. The recording apparatus according to claim 23. 
wherein the recording processing system is further 
operative to record a video signal corresponding to 
the audio signal onto the recording medium togeth- 
er with the audio signal and said feature informa- 
tion. 

25. The recording apparatus according to claim 23, 
wherein the recording processing system records, 
in a distributed manner, said feature information in 
a region of the recording medium in which at least 
the audio signal is to be recorded. 

26. The recording apparatus according to claim 23, 
wherein said audio features extraction system is 



29. The recording apparatus according to claim 24, 
wherein each of said time segments comprises at 
least one frame of the audio and video signal. 

20 

30. The recording apparatus according to claim 29, 
wherein said audio features extraction system is op- 
erative to detect an audio level feature for each of 
a first predetermined set of frames and to detect an 

25 audio type feature for each of a second predeter- 
mined set of frames larger than said first predeter- 
mined set of frames. 

31. The recording apparatus according to claim 30, 
30 wherein said first predetermined set of frames com- 
prises a single frame. 

32. The recording apparatus according to claim 23, fur- 
ther comprising in combination therewith, a repro- 
of duction system for selectively reproducing said time 

segments of said audio signal based on a correla- 
tion of said feature information for individual ones 
of said segments and a selected reproduction mode 
associated with at least one of said features. 

40 

33. The recording apparatus according to claim 32 
wherein said selected reproduction mode is a mode 
in which only audio signals having a particular fea- 
ture are reproduced while other audb signals are 

4S skipped. 

34. The recording apparatus according to claim 32 
wherein said selected reproduction mode is a mode 
in which only audio signals without a particular fea- 

50 ture are reproduced while other audb signals are 
skipped. 

35. A recording method for digitally recording at least 
an audb signal onto a recording medium, compris- 

55 ing the steps of: 

detecting a feature of each of a plurality of time 
segments of the audio signal; 
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generating feature information identifying said 
detected feature of each said tinne segment of 
the audio signal; and 

recording said feature information onto said re- 
cording med lum together with data correspond- s 
ing to the audio signal. 

36. An Information reproduction apparatus for repro- 
ducing at least an audio signal corresponding to au- 
dio data recorded on a recording medium on which 
feature information relating to at least the audio sig- 
nal are recorded, comprising: 

a data reading system configured to read out a 
portion of the feature information prior to any '5 
playback of a corresponding portion of the au- 
dio signal; 

processing circuitry operative to determine 
whether to reproduce said corresponding por- 
tion of the audio signal in accordance with said 
read-out portion of the feature Information and 
a current operating mode; and 
a controller for controlling reproduction of the 
portion of the audio signal in accordance with 
a determination by said processing circuitry. 2S 

37. The information reproduction apparatus according 
to claim 36, wherein: 

a video signal corresponding to the audio signal 30 
is further recorded on the recording medium; 
said data reading system reads out the portion 
of the feature information prior to any playback 
of the corresponding portion of the video signal 
and the audio signal; 3S 
said processing circuitry determines whether to 
reproduce a portion of the video signal and the 
portion of the audio signal corresponding to the 
read-out portion of the feature Information in 
accordance with the read-out portion of the fea- 40 
ture information and the current operating 
mode; and 

said controller controls reproduction of the por- 
tions of the video signal and the audio signal in 
accordance with the determination by said ^ 
processing circuity. 

38. The information reproduction apparatus according 
to claim 37, wherein said controller controls the re- 
production so that the video signal and the audio 50 
signal are reproduced in synchronism with each 
other. 

39. The information reproduction apparatus according 

to claim 37, wherein said feature Information is re- 5S 
corded in a distributed manner In a region of the re- 
cording medium in which the video signal and the 
audio signal are recorded. 
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40. The information reproduction apparatus according 
to claim 37, wherein said audio signal and associ- 
ated video signal are recorded on the recording me- 
dium in blocks, and said feature information is re- 
corded for each block of the video signal and the 
audio signal so recorded. 

41. The Information reproduction apparatus according 
to claim 37, wherein said feature information is re- 
corded for each set of a plurality of frames of the 
video and audio signals recorded on the recording 
medium. 

42. The information reproduction apparatus according 
to claim 37, wherein the feature Information is re- 
corded in a predetermined region of the recording 
medium that is different from a region where the vkJ- 
eo signal and the audio signal are recorded. 

43. The information reproduction apparatus according 
to claim 42, wherein the data reading system is con- 
figured to read out said feature information during 
an allocated time Inten^al in which no audio signal 
is reproduced. 

44. The information reproduction apparatus according 
to claim 43, further comprising a memory for storing 
the feature information that has been read out by 
said data reading system, wherein the processing 
circuitry determines, based on a portion of the fea- 
ture information stored in the memory, whether to 
reproduce portions of the video signal and the audio 
signal corresponding to the portion of the feature 
informatbn. 

45. The information reproduction apparatus according 
to claim 37, wherein the feature information indi- 
cates a level of the audio signal. 

46. The information reproduction apparatus according 
to claim 37, wherein the feature information indi- 
cates a type of the audk) signal. 

47. The information reproduction apparatus according 
to claim 37, wherein the controller controls the re- 
production of the video signal and the audio signal 
so that a ratio between portions of the video and 
audio signals that are reproduced and portions of 
the video and audio signals that are not reproduced 
becomes a specified value. 

48. The information reproduction apparatus according 
to claim 37, further comprising an input system for 
enabling a user to select a reproduction mode as- 
sociated with at least one feature of the audio sig- 
nal. 

49. The information reproduction apparatus according 
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to claim 48 wherein said reproduction mode is a 
mode in which only audio signals having a particular 
feature are reproduced while other audio signals 
are skipped. 

50. The information reproduction apparatus according 
to claim 48. wherein said reproduction mode is a 
mode in which only audio signals without a particu* 
lar feature are reproduced while other audio signals 
are skipped. 
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